Skip to main content

Reply to this post | Go Back
View Post [edit]

Poster: Alex.brollo Date: May 2, 2014 6:05am
Forum: forums Subject: Proposal: a "wikisource" collection

To go deeper into archive.org - wikisource cooperation, a great step would be to create a "wikisource" collection, so getting multiple advantages (for both projects IMHO).

But there's a preliminary question: can an item be listed into multiple collections? I.e.: can an item be listed both into opallibriantichi and to a possible wikisource collection?

Alex

Reply to this post
Reply [edit]

Poster: Nemo_bis Date: May 2, 2014 6:39am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

Yes, multiple collections are ok. There are even some collections where every item is automatically added to 1-2 additional (parent) collections.

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: May 2, 2014 8:02am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

I saw that gerarchical sub-collections (ie opallibriantichi is a sub-collection of additional collection); from your answer I hope that multiple collections are OK even when they share the same level (ti that an item can be assigned to bot opallibriantichi and wikisource, both being sub-collections of additional collections).

Is somewhere a list of wikisource users that have sysop privileges into IA too?



Reply to this post
Reply [edit]

Poster: Jeff Kaplan Date: May 2, 2014 10:22pm
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

not sure exactly what you have in mind. we do have https://archive.org/details/wikicollections and it's subcollections.

Reply to this post
Reply [edit]

Poster: Nemo_bis Date: May 3, 2014 8:09am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

Jeff, thanks for the reply. For a start, a "wikisource" collection could just be a list of items used on wikisource.org. The archive.org interface would be a nice way to browse "our" DjVu files in a prettier way, even if there are no links etc.

If such a collection makes sense for you, I can provide a list of about 10 thousands identifiers and I can volunteer to be the first admin of the collection.

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: May 3, 2014 7:19am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

Thanks for interest, Jeef; no, I've in mind something different, much simpler. I'm thinking simply to create a "horizontal collection" named wikisource, grouping IA books that are used as sources for wikisource. As perhaps you know, wikisource does something that can be seen as the "final derive step" of digitalized books, t.i. converts OCR text into a good (we hope) hypertext, fixing scannos, adding formatting and adding internal and external links. Simply adding a "collection:wikisource" tag to IA items that have been used, or are in use, by wikisource, a list of advantages would pop up:
1. it would be simple to get a list of such items;
2. their metadata could be verified, aligned and refined by admins of collection, using batch routines too;
3. both projects could gain active users;
4. when wikisource gets a good html hypertext, that presently can be derived as pdf or ePub, a copy of such finel result could be uploaded into the list of files of source IA item.

Presently I'm uploading many books into opallibriantichi collection (869 today); I'll add for sure a metadata/a link to any of them used in a proofreading procedure into wikisource; but this subgroup of opallibriantichi+wikisource items will be only a small fraction of IA items used by wikisource projects, that are spread into a variety of different collections and managed by a variety of admins.

Recently there's a new, bold project - wikidata - collecting and organizing metadata about anything into wiki projects, and there's a "wikidata book task force" working about book metadata; a good data alignment between wikidata and IA would be too very interesting and useful.

Reply to this post
Reply [edit]

Poster: Jeff Kaplan Date: May 3, 2014 9:14am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

still a bit unclear, are you suggesting modifying files in existing items or copying files, modifying them and making them available separately?

also, we would not add files to existing items unless they are from the original uploader. any modified or new files would need to be uploaded as separate, new items.

you can do this with any publicly available items. and, you could simply add a subject tag unique to your project that would make finding them through search simple.

Reply to this post
Reply [edit]

Poster: Alex.brollo Date: May 3, 2014 3:16pm
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

Just as a test, I edited metadata of item RimeFrancescoPetrarca, adding a subject:wikisource items and a link to it.wikisource into notes field.

As you told, this is sufficient to retrieve items used by wikisource and to get a backlink to wikisource version; it could be a first step.

Reply to this post
Reply [edit]

Poster: Nemo_bis Date: Jun 24, 2014 7:18am
Forum: forums Subject: Re: Proposal: a 'wikisource' collection

We don't plan to change or reupload any file, only to have an easier (read) access to IA items used by Wikisource. For instance "list Persian language books used on Wikisource", to make an OCR (that ABBYY doesn't do): trivial with internal search, a bit harder otherwise. (Most Wikisource users are not very technical.)

Even a "Wikisource" keyword, or any other searchable metadata field, would be wonderful as well.